Replication Instructions

This replication package includes all the code necessary to reproduce the results presented in the paper. All data processing and analysis were conducted using Stata 17.

The data used in this study come from two main sources: (i) records from the private health insurance company SURA, with access provided by the Secretariat of Health of Cali, and (ii) additional administrative data obtained directly from the Secretariat. The main dataset, HPV_Cali_database.dta, contains anonymized information on 15,231 girls whose parents are affiliated with SURA. It includes vaccination records, sociodemographic variables, and treatment assignment indicators. Only variables relevant to the analysis are included. 

Due to privacy restrictions, these data are not publicly available. Researchers interested in accessing the dataset may contact the Secretariat of Health of Cali. The authors had authorized access and permission to use the data for this study.

To replicate the results, run the main script VPH_Cali_WP_final.do. Before doing so, please install the required package by running: ssc install randtreat. This script processes the data and generates all four figures and nine tables included in the paper.

The folder also includes four additional do-files:
- Table 2 - main model.do
- Table A2 - het dose.do
- Table A3 - het age.do
- Table A4 - het income.do
These do-files are automatically executed when running the main file, and they produce LaTeX-ready (.tex) files to include in the manuscript.

All code for data analysis is provided as part of this replication package.

